Generations of correlation averages 



GIOVANNI COPPOLA - MAURIZIO LAPORTA 

Abstract. The present paper is a dissertation on the possible consequences of a conjectural bound 
for the so-called modified Selberg integral of the divisor function ds, i.e. a discrete version of the classical 
Selberg integral, where da(n) = Yl a bc=n^ ^ s attached to the Cesaro weight 1 — \n — x\/H in the short 
interval \n — x\ < H. Mainly, an immediate consequence is a non-trivial bound for the Selberg integral of 
c?3 , improving recent results of I vie based on the standard approach through the moments of the Riemann 
zeta function on the critical line. We proceed instead with elementary arguments, by first applying the 
"elementary Dispersion Method" in order to establish a link between "weighted Selberg integrals" of any 
arithmetic function / and averages of correlations of / in short intervals. Moreover, we provide a conditional 
generalization of our results to the analogous problem on the divisor function dk for any fc > 3. Further, 
some remarkable consequences on the 2k— th moments of the Riemann zeta function are discussed. Finally, 
we also discuss the essential properties that a general function / should satisfy so that the estimation of its 
' Selberg integrals could be approachable by our method. 

m ' 

' 0. Libretto: introduction and statement of the results. 

In the milestone paper [S] Selberg introduced a determinant tool in the study of the distribution of prime 
f-H , numbers in short intervals [x,x + H], i.e. H — o(x) as x — > 00, namely the integral 
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where A is the von Mangoldt function defined as A(n) = logp if n = p r for some prime number p and for 

def 

some positive integer r, otherwise A(n) = 0. Thus, A is a weighted characteristic function of the prime 
numbers and it is generated by (minus) the logarithmic derivative of the Riemann zeta function, i.e. its 
\Q ' Dirichlet series is — £'(s)/£(s) . Further, being a quadratic mean, the Selberg integral precisely concerns the 

study of the distribution of primes in almost all short intervals [x, x + H] with at most o(N) exceptional 
integers x £ [N, 2N) as N — > 00. Here we define the Selberg integral of any arithmetic function / : N — > C as 



J f (N,H) d S J2\ f(n)-M f (x,H) 

x^N x<n<x-\-H 



where x ~ N means N < x < 2N and Mf(x,H) is the expected mean value of / in short intervals 
(abbreviated as s.i. mean value). In order to avoid trivialities, one assumes that the length H of the short 
interval goes to infinity with TV. In view of non-trivial bounds of such sums, it is easy to realize that the 
discrete version J\(N,H) is close enough to the original integral introduced by Selberg, so that we feel to 
be legitimate to use the same symbol for both versions. Such conditions hold for the arithmetic functions 
we work with and the typical case is the fc— divisor function dk for fc > 3, where dk(n) is the number of ways 
to write n as a product of fc positive integer factors (see [CO] and compare §3). Let us denote the Selberg 
integral of dk as 

j k (n, h) = y y,\ E dk w - Mk ( x -> H ^ 2 



x~N x<n<x+H 

with the s.i. mean value of d^ given by 

def 



M k (x,H) = J (P fe _! (log x) + P' k _ l (log x)) 



def 



where Pk-i is the residual polynomial of degree fc — 1 such that Pfc_i(log:r) = Res s= i Q {s)x s / s 
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The first author has proved the lower bound NH log 4 N < J 3 (/V, H) for H < TV 1 / 3 " 5 (see [CO]), while, in an 
attempt to establish a non trivial upper bound, both the authors have formulated the following conjecture 
for the so-called modified Selberg integral of d 3 , 

MN,H) ±' J2 | E (l - ^)^(-) - M s (*,fl)| a , 

where M 3 (x, if) is the same s.i. mean value of J 3 (/V, H). 

Conjecture CL. If H <£i TV 1 / 3 , tfcen J 3 (N,H)<g:NH. 
Here and in what follows for convenience we write 

A(N, H)4& B(N, H) whenever A(N, H) < £ N e B(N, H) Ve > 0. 

def 

Moreover, we adopt a further convention on bounds of the width of H, i.e. 9 = log Hj log N (more in 
general 9 is defined by x 6 <C H <C x 6 for a; ~ N): the inequality 9 > 9 (resp. < So) means that there 
exists a fixed and absolute constant 5 > such that > 9 + 5 (resp. < 9 — 6). In particular, < 9 < 1 
has to be interpreted a,s S < 9 < 1—5. 

As a consequence one has the following result. 

Theorem I. If Conjecture CL holds, then J 3 (7V, H)<m. NH 3 / 2 . 

Our theorem is an easy deduction by the general link between the Selberg integral J/ (TV, H) and the corre- 
sponding modified one (see §4) 

J f (N,H) d M J2 | J2 (l-^)f(n)-M f (x,H)\ 2 . 

x~N 0<\n-x\<H 

Noteworthily Theorem 1 implies an improvement on Ivic's results [Iv2] for d^ both in the bound and in the 
"low" range where it is valid: while Ivic's bound is non- trivial for N^ 6+s < H < TV 1-5 , i.e. for width 
1/6 < 9 < 1, ours is non-trivial for < 9 < 1/3, that is in the range N s < H < TV 1 / 3 . Further, we think 
that our estimates can be refined in order to get a better range for the width. We remark that, still assuming 
Conjecture CL, the first author [C5] has recently derived the better bound 

J 3 (N, H)<^NH e / 5 . 

Needless to say that our study applies to any divisor function dk, though the conjectured estimates of 
the modified Selberg integral, 

J k (N,H) | E i 1 - l - I1 Jf^)Mn) - M k (x,H)\ 2 , 

x~N 0<\n-x\<H 

become less and less meaningful as k grows. This is essentially due to the poor state of knowledge about the 
distribution of d k in long intervals, namely the known value of the exponent ot k such that (compare §3) 

(*) E dfe ( n ) ~ xPk-xilogx^x^ . 

n<x 

We generalize Conjecture CL for J k {N,H) with k > 3 as it follows. 

General Conjecture CL. Assume that a fe _i e [0, 1) in (*) for a fixed integer k > 3. If H < N 1 / 2 , then 
J k {N, H)<m: N 1 - 1 ^!! 2 + NH{N 1 - i ' k + jy* 1 - 1 /*)"*-!-!/*) . 
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Consequently, setting 9k — max (l — — , (1 — — jctk-i — , for every width 9 G (6k, 1/2] there exists an 
ei = si{9, k) > such that 

J k {N,H) < N 1 - 461 !! 2 . 

Similarly to the case k = 3, from the last inequality it follows our second main result. 
Theorem 2. If General Conjecture CL holds, then J k (N,H) < N 1 ^ 261 !! 2 . 

Again as a consequence one would get an improvement in the low range of H with respect to the results of 
I vie for the mean-square of dk in short intervals [Iv2]. In fact, Theorem 1 in [Iv2] holds for 9 E (9k, 1) with 
9k defined in terms of Carlson's abscissae (see §6). In particular, it holds for 64 = 1/4, 5 = 11/30, 6§ = 3/7, 
whereas we get non-trivial estimates for widths 9k < 9 < 1/2 with 64 = 11/128, 9 5 = 1/5, 6* 6 = 1/3. See §7 
for further details, where one finds the so-called k— folding method that is at the core of our conjectures. 

It is apparent from our study that generally Jf (N, H) fits the request of "smoothing" the Selberg integral 
Jf(N, H), both in the arithmetic and the harmonic analysis aspects. The arithmetic matter essentially relies 
on a simple observation going back to the Italian mathematician Cesaro around the end of the 19th century: 

0<|n-x|<H V 7 h<H\n-x\<h 

This is a kind of arithmetic mean of the inner sum in Jf(N, H) and somehow justifies the appearance of the 
same mean-value term in the modified Selberg integral. The analytic aspects of such a smoothing process 
will be better understood after the introduction of the correlation C/(ft) in §2, where it is showed that Selberg 
integrals of / are strictly related to averages of 6/ (ft) in short intervals |ft| -C H . 

The next corollaries testify such an intimate link and also conditionally improve recent results [BBMZ] 
and [IW] on an additive divisor problem for dk- More precisely, they concern the deviation of dk, i.e. 

B k (N,H) = f E Mn)d k (n - ft) - ^ ^ M k (n, H) 2 = £ G k (h) - H ^ p^Qogn) 2 , 

h<Hn~N n~N h<H n~N 

where Cfe(ft) d = dk(n)dk(n — ft) is the correlation of d k (see §2) and pk-i(\ogn) d = Mk(n,H)/H is the 

so-called logarithmic polynomial of dk(n). The following further consequence of Theorem 1 is proved in §6. 
Corollary 1. Let N,H be positive integers such that < 9 = log H/ log N < 1/3. Then 

B 3 (N, H)<^ NH 3/4 + N a3 H + H 2 . 

Analogously, from Theorem 2 we get a non-trivial estimate for the deviation of dk for any k > 3 (see §7). 
COROLLARY 2. Under the same hypotheses of Theorem 2 one has, in the same ranges and for the same S\, 

B k (N, H)<^. N 1 ~ ei H. 

The aforementioned link to [BBMZ] and [IW] results is due to the identity (see §3) 
p fe _i(loga;) = P k -i(\ogx) +P' k _ 1 (\ogx) = Res ( k (s)x s - 1 , 

s=l 

where one has to be acquainted that our notations Pk-i and pk-i are not consistent with those in [BBMZ] 
and [IW]. In particular, from equation (3.8) in [IW] it turns out that 

r-2N 

Hi pk-i(\ogx) 2 &x + £ (N 1+e ) 
Jn 
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is the main term in the formulas for sums of d k correlations established in [BBMZ] when fc = 3 and in [IW] 
for every k > 3. Since it is easily seen that 



/■2JV 

y~] p fe _i(logn) 2 - / p fc _i(loga;) 2 dx«§; 1 



n~N 

then for every H <C N it follows 

,-2N 



revealing that within negligible remainders <®CiV our H>k(N,H) is comparable with X^ft,<H Afc(-/V; h), that 
is the average of errors for d k correlations estimated in [BBMZ] when fc = 3 and in [IW] for every fc > 3. 
Thus, Corollary 1 and the best known a 3 < 43/96 (Kolcsnik, 1981) imply that © 3 (7V, H)<^. NH 3 / 4 for 
9 < 1/3, which improves [BBMZ] in the low range of short intervals ([IW] bounds are better for fc = 3 when 
9 > 1/2). In fact, their remainders total <gc 7V 13 / 12 \/If , that is worst than •^NH 3 / 4 ' when 6 < 1/3, and 
are non-trivial only for 9 > 1/6. Similarly, since Corollary 2 holds in the ranges prescribed by the General 
Conjecture CL, we get an improvement on the estimation of U>k(N,H) in the low range of short intervals 
with respect to [IW] bounds' non-trivial ranges, though they have better high range, say 1/2 < 9 < 1. 

The novelty of our approach is that, though conditionally, it leads to valuable improvements with 
respect to the analogous achievements obtained via the classical moments of the Riemann zeta function 
on the critical line 3?(s) = 1/2, while we think that the conjectures CL on the modified Sclbcrg integrals 
might be approachable by elementary arguments. On the other side, estimates of Jk(N,H) have non-trivial 
consequences on the 2fc— th moments of C (see [C2]) defined as 

dt . 



«r> = X \<(l + u ) 



Thus, at the moment we content ourselves in having found an alternative way to pursue possible improve- 
ments on the 2fc— th moments of £ at least for relatively low values of fc. Indeed, in §8 we take a glance 
at the effect of hypothetical estimates for Selberg integrals on the 2fc— th moments through Theorem 1.1 of 
[C2], whereas our conjectured J3 bound provides effectively the best known estimate for the 6— th moment 
of Riemann ( function after recent [C5]. In particular, in §8 we prove next result, which gives a link between 
conditional bounds of Selberg integrals Jk and bounds of Ik(T). 

Theorem 3. Let k > 3 be fixed. If J k {N, H)<m: N 1+A H 1+B holds for H < N 1 - 2 ^ and for some constants 
A,B>Q, then I k {T)^T 1+ ^ A+B )- B . 

Here, beyond the dependence on e, the constant involved in <^ may depend on fc. 

As an immediate consequence of Theorem 3 combined with the results in [C5], namely J 3 (N, H)<^ NH 6 / 5 
in the range H <C iV 1 / 3 , we get 

7 3 (T)«T 11 / 1 o . 

This encourages us to follow such a pattern and to pursue non trivial estimates for the modified Selberg 
integrals of d k in the future. 

Plan of the paper. 

§1 Beyond the aforementioned instances, further variations of the Selberg integral can be considered, ac- 
cording to the weight w suitably attached to an arithmetic function /. We give a very short introduction 
to the so-called w-Selberg integral of /. 

§2 We introduce the correlations of an arithmetic function / and of a weight w. For a wide class of 
arithmetic functions it is shown through the Dispersion Method that weighted Selberg integrals are 
strictly related to averages of such correlations (see Lemma 1). 
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§3 In an attempt to generalize further our results, we have abstracted the essential properties that an 
arithmetic function / has to satisfy so that its Selberg integrals can be approachable by our method. 
Mainly inspired by the prototype dk, we devote this section to the definitions and basic properties of the 
essentially bounded, balanced, quasi- constant and stable arithmetic functions. Inevitably in this analysis 
one finds references to the famous Selberg Class. 

§4 For a real, balanced and essentially bounded function /, the second and the third generation of corre- 
lation averages in short intervals correspond respectively to the Selberg integral and the modified one. 
Here we exploit further the properties of such functions in long intervals to outline a chain of implications 
that under suitable conditions from a non-trivial estimate for Jf (TV, H ) generates a non-trivial bound 
for Jf (TV, H ) . Applying these implications to the divisor functions one gets immediately Theorems 1 
and 2. Such a bound for J/ (TV, H) in turn becomes an effective mean to pursue a "good deviation" of /, 
i.e. an error term in the asymptotic formula for the first average of the correlation of / in short intervals, 
whereas, as we shortly recall in §5, the expected formula for the single correlation is just conjectural for 
most significant instances of /. 

§5 It is a short excursion on some very special cases of single correlations whose conjectured asymptotic 
formula? have been proved. 

§6 Here one finds the proof of Corollary 1, that follows rather easily through the general arguments of §4. 
Corollary 2 follows similarly although it is discussed in the next section. 

§7 At least in principle, the strategy applies also to dk for any k > 3 as an application of the general 
k- folding method that is described by Lemma 2. 

§8 As an application of Theorem 1.1 in [C2], we prove Theorem 3, that emphasizes the consequences of 
Jfc(TV, H) bounds on the moments of the Riemann zeta function on the critical line. 

§9 We call upon the Selberg integral of d 3 to address the last word on the best "unconditional" exponent 
for the 6— th moment, that becomes an immediate consequence of Conjecture CL, Theorem 3 and [C5] 
bounds. 

Some notation and conventions. 

If the implicit constants in O and <C symbols depend on some parameters like e > 0, then mostly we specify 
it by introducing subscripts in such symbols like O e and <C £ , while we avoid subscripts for <gc defined above. 
Notice that the value of e may change from statement to statement, since e > is arbitrarily small. 

The relation / ~ g between the functions /, g means that / = g + o(g) as the main variable tends to 
infinity typically. No confusion should be possible with the dyadic notation, x <~ TV, which means that x is 
an integer of the interval (TV, 2TV] , as already said. 

The Mobius function is defined as //(l) = 1, fi(n) = (— l) r if n is the product of r distinct primes, and 
fi(n) — otherwise. The symbol 1 denotes the constant function with value 1 and lu is the characteristic 
function of the set U. The Dirichlet convolution product of the arithmetic functions f\ and fi is 

(/i * h ) (n) = f h (d) h (n/d) Vn e N . 

d\n 

def 

In particular, we call = /*■■■*/ the k—fold Dirichlet product of the arithmetic function /. For any 

S v -' 

k times 

/, g : N — > C the Mobius inversion formula states that / = g * 1 if and only if g = f * fi, which is called the 
Eratosthenes transform of /. For example, 1 is the Eratosthenes transform of the divisor function d = 1 * 1. 
More in general, dk — 1 * ■ ■ ■ * 1 = lk is the k— fold Dirichlet product of 1 for k > 2. 

k times 

For simplicity, in sums like ^2 a<x it is implicit that a > 1. The distance of a real number a from the nearest 
integer is denoted by ||a| = min({a},l — {a}), where {a} is the fractional part of a. As usual, we set 
e(a) = e 2m , Va G M, and e q (a) = e(a/q), e N, Va e Z. 
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1. Preludio: weighted Selberg integrals. 

Given positive integers N and H = o(N), the w— Selberg integral of an arithmetic function / : N — > C is the 
weighted quadratic mean 

J wJ (N,H)= f £ \j2w(n-x)f(n)-M f (x,w)\ 2 , 

x~N n 

where the complex valued weight w has support in [—cH, cH] for some fixed real number c > 0, so that the 
inner sum is genuinely finite. The term Mf(x,w) is the expected mean value of / weighted with w in the 
short interval of length <C H and its dependence on w has to be specified according to the different instances. 
In particular, according to the study exposed in §3, when it is possible to define the logarithmic polynomial 
Pf(\ogn) we set 

Mf(x,w) d = ^2w(n - ar)p/(logn) = p f (\ogx) ^ w(h) + 0(i? 2 iV e_1 ) . 

n h 

Clearly, the weighted Selberg integrals include the most celebrated case of the original Selberg integral, since 

def 

J\(N, H) — J u .a{N, H), where u = l[i,in is the characteristic function of [1, H\. More in general, Jf(N, H) 
is the u— Selberg integral of /. The modified Selberg integral Jf(N,H), introduced by the first author in 
[CI], is recognizable as a weighted Selberg integral by taking the Cesaro weight, say 

<W.)*(i-g) + — (i-g.o). 

Since 

0<Jf-|t| a,6<H 
b — a = t 

where G u is the correlation of the weight u (see next §2), then we refer to the Cesaro weight as the "normalized 
correlation" of u. More in general, we smooth the weighted Selberg integral J w j(N,H) by defining the 
modified w— Selberg integral of / as 

J wJ (N,H) d ^ f J~ f (N,H) , 
where the new weight w is the normalized correlation of w, i.e. 

w{h) d = ^-^^w{n)w{m) . 

n m 
n-m = h 

Another important instance of the weighted Selberg integral has been intensively studied by the first author, 
i.e. the symmetry integral of / given by 

J SS nj(N, H) d = f J2 | J2 s ^ n ~ x )/( n )f ' 

x~N 0<|n-a;|<_H" 

def def 

where sgn(0) = 0, sgn(t) = \t\/t for t ^ 0, and Mj{x, sgn) vanishes identically for every /. The study of the 
symmetry integral has been motivated by the work of Kaczorowski and Perelli (see [KP] ) , who were the very 
first to exploit a strict relation of the classical Selberg integral with the symmetry properties of the prime 
numbers. Indeed, in [C], [CS], [C3], [C4], the symmetric aspects of the distribution of several samples of 
arithmetic functions in short intervals are studied through the analysis of the associated symmetry integral. 

It is worthwhile to exploit the link between J sgn ,/ and the modified symmetry integral J sg nj in future 
papers. As in the case of any odd weight w, they demand the s.i. mean values to vanish. On the other side, 
note that the normalized correlation of any w is even and inside Mf (x, w) we have J2 h w(h) = \^2 a w(a)\ /H. 
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Finally, some considerations in §3 make it plausible that a satisfactory general theory, for the weighted 
Selberg integrals, may be built within the environment of the Selberg Class. 

2. Ouverture: weighted Selberg integrals as correlation averages. 

By correlation of an arithmetic function / : N — > C we mean a shifted convolution sum of the form 



def 



where the shift h is an integer such that \h\ < N. Observe that one might take into account only the 
restriction of / to 1 < n < 3N. Further, a correlation of shift h is essentially a weighted count of the integer 
solutions n,m £ (N, 2N] of the equation n — m = h , as 



C/W= EE /(»)/("») + 0(\h\- max \f(£)\ 2 ) . 



a=h 



In the present context it is convenient to define the correlation of a weight w by neglecting the O-term, i.e. 



a b 
b — a = h 



The reason of such a different definition will be clarified after next lemma, where we prove a strict connection 
between correlations and weighted Selberg integrals by applying an elementary Dispersion Method. 

Lemma 1. Let N,H be positive integers such that H — > oo and H = o(N) as N — > oo. For every uniformly 
bounded weight w with support in [— cH, cH] and every arithmetic function f one has 

J wJ (N,H)= E e w (h)e f (h)-2^[j2f^J2 w(n-x)M f (x,wj) + Yl \M f (x,w)\ 2 + (HWJ , 

0<\h\<2cH n x~N x~N 

where \\ f \\oo d = max \f(n)\ . 

N-cH<n<2N+cH ' V U 

Proof. By expanding the square and exchanging sums one gets 

j wJ (n,h) = e ( E w ( n - - M /( x ' w )) ( E w ( m - x ) • 7m - M /( x > w )) = 

x^N n m 

= E E 7M E w ( n - s ) w ( m - x ) - m ( E / W E Mri-^M/^w;)) + E \Mf(x,w)f . 

n m x^N n x~JV x~N 

Thus, it suffices to show that 

E /(») E7M E - x)w(m - x) = E e w (h)e f (h) + O (H 3 \\f\\l) , 

n m x~N h 

where we may clearly assume that a d = m — x,b d = n — x e [—cH, cH). Consequently, we write 

E-^ n )E^ m ) E w(n-x)w(m-x) = E EE-^ n ^( TO ) EE w(b)w(a) . 

n m x~N \h\e[0,2cH] „"_ m ™ h a,be[-cH, C H] 

n-b = m-ae(N,2N] 

Since the condition n-i = m- a£ (N, 2N] is implied by n, m e (iV + off, 2N — cH], then the latter is 

E EE i»7M EE 

|/l|e[0,2cH] ».»E(If+dJ,2»-tB] a,f>e[-cH, c H] 
n. — m = h b — a = h 



+°(*n/n a ~ E (E E i+E E i)) = 

|/l|<E[0,2cff] " m e(JV-cff,N + <:ff]u(2JV-cff,2JV + c ff] m Jie(N-cff,JV + cff]u(2N-cff,2JV + cff] 

= E ( EE /m7m + EE /m/m) EE 

|/l|G[0,2cH] n~JV,me(JV + c-ff,2iV-cJf-] ne(N + cH,2N-cH],m~N a,b(=[-cH ,cH] 

n — m = h n — m = h b — a = h 

+o(ff|i/iiLE EE O+o^ii/iilE EE i) = 

"ie(Af,Af + c/J]u(2iV-c/f,2Af] h<giH m€(N-cH,N + cH]u(2N-cH,2N + cH] 

n — rn = ±h n — m = ±h 

= E EE /w/h EE ™(b)Aa) + o(H\\f\\i J2 E x ) = 

0<\h\<2cH n~N m ~N -eH<a,!><cH h«.ff m£ ( N-cH, N+cH] U <2N-cH,2 N+cH] 

n — m = h b — a = h 

= E (e / W+o(||/|| 2 00 l^l))e.(M+o( J ff 2 ||/||L(2 Cj ff + i))= E e w (/oe,(fc)+o(ir 3 ||/||L). □ 

0<|h|<2cff 0<|/i|<2cff 

Remark. The remainder term CK-ff 3 !!/!)^) is essentially due to the estimate of "short" segments of length 
<C H within "long" sums of length ^> TV. We refer to these short segments as the tails in the summations. In 
order to simplify our exposition, the symbol (T) within some of the following formulae will warn the reader 
of some tails discarded to abbreviate the formulae. 
Thus, by using the exponential sum 1 



fifi) = E /(»Mn0) - 



def 

we write 



e/W= ££/(n)/(m) + 0(ll/l£,W) = f 1 \fW)\ 2 e(-hP)dP + 0(\\f\\l\h\)W f 1 \f(P)\ 2 e(-hP)dP . 

m ~Nn~N JO JO 

n-m = h 

An important aspect is that the exponential sums, whose coefficients are correlations of a weight w, are 
non-negative. More precisely, 

€ w {p) =J2e w (h)e(hp) = £££ W (6M^)e(/ i /3) = I 5>(n)e(n/3)| 2 = \w(f3)\ 2 V/3 e [0,1) . 

h h b—a—h n 

In particular, for the correlations of u — one gets the Fejer kernel 

ei(J3) = \u((3)\ 2 = |£ u (n)e(n0) * = I £ 



n<H 



More in general, the Fejer-Riesz Theorem [F] states that any non-negative exponential sum is the square 
modulus of another exponential sum: 

w(/3)>0 V/3e[0,l) <^=> 3v : w{f3) = |w(/3)| 2 V/3 e [0, 1) . 

A particularly easy instance of this theorem follows by recalling that the Cesaro weight is the normalized 
correlation of u, i.e. Cn(h) = G u (h)/H (see §1). Hence, again Fejer's kernel makes its appearance in 



E(i-77j e(W) = -iE e «(W) = 



+ h 



1 Apart from j3 sign, is also-called the discrete Fourier transform of /. Hereafter we will not specify 
that it is a finite sum. 
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We also use to say that the Cesaro weights are positive definite. We think that basically such a property makes 
the aforementioned smoothing process work for the modified Sclbcrg integral under suitable hypotheses on the 
function / (see §3), while for the classical Selberg integral J/ (AT, H) and the symmetry integral J sgn j(N, H) 
it is plain that "«" and "sgn" are far from being positive definite weights. One could exploit such a positivity 
condition in order to prove a non-trivial result for the modified Selberg integral of the divisor function dk- As 
a general strategy, from a non positive definite weight w with support of length at most H one could call for 
its normalized correlation w — Q w /H generating the non negative exponential sum Q w ((3)/H = \w((3)\ 2 /H. 

3. Starring: essentially bounded, balanced, quasi-constant and stable arithmetic functions. 

The wide class of arithmetic functions under our consideration consists of functions bounded asymptotically 
by every arbitrarily small power of the variable according to the definition 2 : 

de f 

f is ESSENTIALLY BOUNDED Ve > f (n) <C e if , 

that we shortly denote by writing /<^C 1. However, in several circumstances one has to deal with arithmetic 
functions having support in an interval of length <C A?" or simply with functions restricted to such an interval. 
Thus, more in general among the essentially bounded functions we include / such that f(n) <C £ N e Ve > 0. 
A well-known prototype of an essentially bounded function is the divisor function dk, whose Dirichlet series 
is C(s) k . Similarly, the Dirichlet series 

n—l 

is defined at least in the right half-plane 5R(s) > 1, whenever the generating function / is essentially bounded 
(say the abscissa of absolute convergence is a ac < 1). Recall that through Perron's formula the expansion of 
F at s = 1 leads to an asymptotic formula for the summation function from 

1 rc+ioo s 

97 / F(s)-ds (x?N), 

n<x £m Jc - l °° S 

where c > max(0,er ac ). More precisely, with the aid of further properties of the Dirichlet series and the 
Residues Theorem, such an asymptotic formula becomes 

^/(n)=Res F(s)^+R f (x) , 

where Rf(x) is an error term as long as it is smaller than the main term. Assuming that F is meromorphic 

and denoting the polar order 3 of F by mp d = ord s= i F, the main term is more explicitly written as (compare 
[De]) 

xP f (logx) d =Res F(s)— , 

s=l s 

where Pf is the residual polynomial of /, which has degree mp—X, while it vanishes identically when mj? < 1. 
For the remainder term a good estimate would be (compare [De] again) 

Rf{x)<^x a ^ 

with a suitable < a(f) < 1 (negative values of exponent ce(f) are possible, but discarded as "meaningless"). 
This is the case for any divisor function d k - Indeed, from (*) of §0 one has 

^d fc (n)=ar.P fc _i(loga:) + A fc (a;) with A k (x)<m:x ak , 

2 That is / satisfies one of the Selberg class axioms, the so-called Ramanujan hypothesis (see esp. [De]). 

3 That is the order of the pole of F at s = 1. 
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where the degree of the residual polynomial Pk-i (see §0) is fc — 1, because the polar order of £ fc is rrik = k, 
and ak < 1 — 1/fc is what one can infer inductively from the elementary Dirichlet hyperbola method applied 
to the first case k = 2. More precisely, one has Afc(x) <C x x ~ x l k log fc ~ 2 x. Now, by partial summation it is 
easy to determine the logarithmic polynomial Pk-i such that 

^Pfe-i(logn) = .xP fe _i(loga;) + 0(log fc_1 x) . 

n<x 

Thus, we get the decomposition 

def — 

dfc(n) = pfe-i(logn) + d k (n) , 

that is a "balancing" of dk(n) from which the very slowly increasing polynomial pfc-i(logn) is subtracted. 
The Dirichlet series generated by dk times x s /s has zero residue at s = 1. Moreover, (*) is equivalent to 

(?) ^4(")<«x Qfc . 

n<x 

This invites to formulate the following definitions 4 : 

f is balanced Res — F(s) = Res — > = 0, Va; 

s=i s w s=i s ^ n s 

n=l 

(that is F has an analytic continuation in s = 1, because < 0), 

/ is well-balanced OF exponent a ^ f{n)<<^x a for some a e [0, 1) . 

n<x 

Of course, any well-balanced function is also balanced because the previous bound implies that the Dirichlet 
series is regular at s = 1. However, the converse needs not to be true, as A(n) — 1 is a balanced function, 
but the existence of an exponent a < 1 is still far from being proved (see some further comments below). 

An essentially bounded arithmetic function a : N — > C is said to be QUASI-CONSTANT if there exists 
A e C 1 ([l, +oo), C) such that 5 A'(t)<^l/t and A\^ = a. Clearly, the logarithmic polynomial pk-i (log n) 
is quasi-constant with respect to n and this, together with the fact that dk is a well-balanced arithmetic 
function of exponent a k , suggests the following further definition. 

An arithmetic function / is stable of exponent a if there exist a quasi-constant function a and a 
well-balanced function b of exponent a such that / — a + b, while the AMPLITUDE of / is defined as 

def 

a(f) = inf {a e (0, 1) : / is stable of exponent a} . 

Recall that the Dirichlet divisor problem requires to prove the conjectured amplitude ot2 — a(d) = 1/4, 
while one infers ct2 < 1/2 by the Dirichlet hyperbola method and the best known bound at the moment is 
d2 < 141/416 (Huxley, 2003). In what follows, ctk — a(dk) is the best possible exponent in (*) and (*). 

According to Ivic [Iv2], the mean value in the Selberg integral of any arithmetic function /, whose 
Dirichlet series F(s) converges absolutely at least in the half-plane 3?(s) > 1 and is meromorphic in C, has 
the analytic form given by 

M f (x, H) = f H (P f (log x) + P' f (log x)) , 

Although with a different meaning, such a terminology has been coined by Ben Green and Terence Tao. 
Mainly, Green [Gr] calls balanced a function / — 5 when / is a characteristic function of a set with density 5. 

5 The condition on the derivative A' implies that A is essentially bounded, provided a depends only on 
n. However, we leave open the possibility that a and A might depend on auxiliary parameters. 
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where Pj is the derivative of the residual polynomial of /. We remark that Mf(x,H) is linear in / and is 

def 

separable, i.e. the variables H, x are separated. Recall that pf = Pf + P'j is the logarithmic polynomial. 

Philosophically speaking, although completely justified from an analytic point of view (using mean- value 
theorem and derivatives bounds), such a choice of Mj(x,H) is not satisfactory, since one should expect to 
find the same Selberg integral for / and its balanced part / = f — pf. Indeed, this is the case whenever we 
define the s.i. mean- value as (see §1) 

P/(logn) = Hp f (log x) + O e (N'H 2 /N). 

x<n<x+H 

Of course, up to negligible remainders this is still possible under ivic's hypothesis. Let us give an idea of a 
possible extension of these considerations to the case of a more general function /. 

Bearing in mind (*), given any arithmetic function /, we call a polynomial Pf such that 
(*)/ ^ f(n) =xP f (log x) + O s (x E+a ) witha<l 

n<x 

the residual polynomial of /, although Pf is not necessarily defined from the residues in s = 1 with the 
Dirichlet series F(s). Then, let us define the logarithmic polynomial of / as 

P/ (logx) d = f A (xP f (log x)) = Pf (log x) + P' f (log x) . 

Under Ivic's hypothesis on / this immediately yields 

M f (x,H) = Hp f (log x) 

and ^ 

P/(loga;) = Res F(s)— + Res F(s) ^ - ^ = Res F(s)x s ~ 1 . 

s=l S s=l S s=l 

In the case / = dk this property allows us to to compare [IW] results to ours (see §0). 

On the other side, by assuming the sole property (*)/ one gets a unique polynomial pf such that 

5^ P/ (logn) = J Pf(logt)dt + 0(log c x)=xP } (logx)+0(log c x)~Y;tt n ) - 

n<x ^ n<x 

where c > is the degree of pf. Since pf(logx) is a quasi-constant function, this implies that 

/(«)~ PfQogn) ~ Hp f (logx) . 

x<n<x+H x<n<x+H 

Thus every arithmetic function / satisfying (*)/ admits a logarithmic polynomial pf and the analytic form 
of the mean-value in short intervals, inside the w-Selberg integral, of such a function / is 

^w(n-x)p f (\ogn) = Pf (logx)J2w(h) + 0(H 2 L c - 1 /N), 

n h 

def 

where L = log N hereafter. In particular, the analytic form of the mean value in the Selberg integral of c?3 
is explicitly given by 

M 3 (x, h) = h(P 2 (logx) + P^logx)) =h(^ log 2 x + 3 7 logx + 3 7 2 + 3 7 i) , 
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with P2{t) = t 2 /2 + (37 — l)t + (37 2 + 371 — 37 + 1), where 7 is the Euler-Mascheroni constant and 71 is a 
Sticltjcs constant defined as 

j<m ' 

Recall that this is also related to the summation formula [Ti] 



J2 d 3( n ) = xP z ( lo S x ) + 0(x 2/3 log x) . 



n<x 



From the application of the 3— folding method (see §7), recalling that x <~ N, it comes out that the arithmetic 
form of the mean value in the Selberg integral of d 3 is (here M = [(N — h) 1/>3 ], compare §7) 

q<fj H d l <M d 2 < 1I f M d<M 

Indeed, let us show that M 3 (x,h) can be replaced by M 3 (x,h) within J 3 (N,H) at the cost of a negligible 
error term. More precisely, we prove that 

M 3 (x, h) - M 3 (x, h)<m: hN- 1/3 uniformly Vz ~ N . 

At this aim, we apply Amitsur's formula [A] with Tull's error term [Tu] 6 , i.e. 

E^ = ^ + 2 7 logg + (7 2 + 2 7l ) + o(^), 

q <Q q ~ V VW 7 



to get 



E ~ = \ Xo ^ x - (log M) log a; + ^ log 2 M + 2 7 logx - 2 7 logM + ( 7 2 +2 7l ) + 0(N~ 1/3 ) . 

1<-M 



From the standard asymptotic formula for E ^ one has 



d 

d<D 



") E X E j- = (logM + 7 )lo ga ; + 7 2 -log 2 M-^^+7 1 + 0(7V- 1 /3L) , 



Hi) ( E 3) = lo S 2 M + 2 T lo § M + 7 2 + 0{N-^ 3 L) . 

d<M 

Thus, i), ii) and in) imply the claimed inequality, because 

= \ log 2 x + 3 7 logx + (3 7 2 + 37!) + 0(N^ 3 L) = + o.^-i/Sj . 

Such a proximity of the two terms M 3 (x,h) and M 3 (x,h) suggests that, even for more general essentially 
bounded function /, one should expect the same mean value term Mf(x, h) in short intervals for the Selberg 

6 Amitsur derived a symbolic method to calculate main terms of asymptotic formulae. Tull, a student of 
Bateman, gave a refined partial summation, that allows here to transfer error terms from the formula for 
So<Q d ('?)' l lkc Dmchlet's classical 0(s/Q), to this formula for Y,„<q 

d(q)/q. 
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integral and for the modified one, whenever a suitable arithmetic form Mf{x, h) w h g(q)/q, with f = g*l, 
is proved to be sufficiently close to the analytic form (determined by the residues of the Dirichlet series 
generated by /). This seems to be reliable at least when Mf(x,t) is separable, i.e. Mf(x,t) — tMf(x), 
with t — o(a;)and A4f(x) is a slowly varying function with respect to x (namely, a small A4'f(x) like the 
x— derivative of polynomials in the variable logx). Indeed, the identities 

E ( 1 -^X^)/( n ) = ^E E /(») - E /(«)«M / (x,2m-l) = (2m-l)M / ( a; ) 

0<|n— x|<7i m</lO<|ra— x|<m 0<|ri— ir|<m 

imply together 

E ( X - M \ E ^/(^2m - 1) = ftA4/(a;) = M f (x, h) . 

0<|n — x\<h m<h 

We refer the reader to the further discussion in §7 for the generalization to any divisor function dk through 
the so-called k— folding method. Of course, dk is not the only function suitable for (*)/. For example, De 
Roton [De] has showed that, if the Dirichlet series F belongs to the so-called Extended Selberg Class (ESC) 

deg F — 1 

with deg F > 1, then (*) f holds with a = — — — — ^ . Further, if / is also multiplicative (so that its Dirichlet 

series F has a suitable Euler product) and essentially bounded, then F belongs to the special subset of ESC, 
called Selberg Class (see [KP(012)]). Hence, according to our definitions De Roton's result (after [KP(012)] 
breakthrough on Selberg Class) becomes: 

/ has Dirichlet series in the Selberg Class with degree d > 1 =>■ / is stable of exponent — — - . 

d + 1 

In particular, this applies to any dk, since every power ( k for k > 1 belongs to the Selberg Class with 

k — 1 

degC fc = k. However, the De Roton exponent is weaker than the one obtained by other methods. 

k + 1 

Actually, the bound Uk — oi(dk) < 1/2 for k < 4 assures that the function dk for k < 4 admits (at least) 
square-root cancellation for the error terms, a property shared by every stable arithmetic function with a 
sufficiently small amplitude. This motivates the following further definition: 

def 

f is random -^=4 / is stable of amplitude a(f) < 1/2 . 
For example, it is well-known (see [D]) that the Riemann Hypothesis (RH) is equivalent to the inequality 

R A (x) = f ^(A(n) - 1)«< x/i , 

that is to say the von Mangoldt function is random 8 . On the other side, it is also well-known that uncon- 
ditionally the inequality Ra(x)^ x a holds only if a = a(A) > 1/2. In other words, A cannot be stable of 
exponent a < 1/2 . Such a circumstance is better expressed by the definition: 

de f 

f is strictly random <=> f is stable of amplitude a(f) = 1/2 . 

Thus, RH is equivalent to say that prime numbers are strictly random. The existence of a(A) < 1 corresponds 
to a quasi- RH because of another well-known analytic property of the prime numbers 9 : 

a(A) = sup{^ : ((/? + i-f) = for some 7 ^ 0} . 



7 The degree of F in ESC is defined in terms of its functional equation (see [De] for details). 

8 It is also a well-known fact the equivalence between RH and the randomness of the Mobius function fi, 
suggesting that /x behaves like A — 1. We refer to [IwKo] for further details on the Mobius randomness law. 

9 Recall also that a(p) = a (A). 
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The Polya- Vinogradov inequality (see [D]) provides a non-conditional example of a strictly random 
arithmetic function, namely any non-principal Dirichlet character %(mod q), since it yields 

n<q 

which is known to be essentially optimal (compare [Go] & [Te]). The actual results and the expected values 
ctfc = (k — l)/(2fc) in general (see [IvO]) reveal that the k— divisor functions arc not strictly random. 
Returning back to our integrals, if / is real and essentially bounded, then from Lemma 1 we have 

j f (N,H)^J2e u (h)e f (h)-2j2f(n) J2 u ( n - X ) M A X > H )+ E M /(^) > 

ft n x~N x^N 

J f {N,H) VJ2 e e u/H (WfW - 2 E/W E eu(n ~ X) M f (x,H) + £ Mj(x,H) . 

ft n x~N ' x~Af 

When / is also balanced, then Mf(x,H) vanishes identically. Consequently, 

J f (N,H)WJ2Zu(h)e f (h) and J f (N, H) ( ~> £ Q &JH {h)Q f {h) . 



4. Story: smoothing correlations by arithmetic means. 

Recalling that 

e/W ~ / \KP)\ 2 e{-hp)df3 with f{p) = /W e (^) . 

Jtt N<n<2N 



one easily infers 



(i) -£u(h)e f (h)= f 1 \m\ 2 u(-p)dp+o(H"\\f\\i)^ f 1 \m\ 2 u(-p)dp 

(ii) ^e„(/ l )e / (/ l )= / 1 |7(/ 3 )| 2 ^(-/ 3 ) d / 3 + (^ 3 ll/H-) s f \fW)\ 2 \m\ 2 W 
( //J ) E e e„/HWe/W-^\f(/3)r%^d/? + o(i/ 3 ||/||L) < - ) ^ 1 



2 



/(«| 2 |^)| 2 -^ 



ft 

In particular, for every balanced real function /<gc 1, from the previous section we get 

|4 



J f {N,H)W [ \f(f3)\ 2 \u(/3)\ 2 d/3 and J f (N,H)W [ \f(J3) 
Jo Jo 



2 



«(/3) 



-d/3 



if 2 

Formulae (I), (II) and (III) correspond respectively to the following iterations of correlations' averages. 
1st generation : ^2 6/ (ft) = y\(ft)C/(ft) (sums of correlations) 



ft<ir 



2nd generation: ^ ^ S/(a) = ^ e„(ft)C/(ft) (double sums) 

ft<ffO<|a|<ft ft 



3rd generation: E E E ^"^^ M ^ 2 ^ C/(ft) = E /ijW^/W (average of double sums) 

ft /l2 — ftl— ft 
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Such an obstinate process of averaging is motivated by the fact that it is rarely possible to find an asymptotic 
formula for the single correlation C/ when / is a significant arithmetic function. As we already said, the 
correlation of / counts the number of ft,— twins not only when / is a pure characteristic function (the von 
Mangoldt function is a typical case). In general, the underlying Diophantine equation is a binary problem 
that is out of reach with the present methods (see next §5 for some details). On the other side, the higher 
is the degree of a generation of the correlations' averages, the smoother are such averages and consequently 
we have more hope to get non-trivial asymptotic estimates. However, even at a 2nd generation level this 
hope is quite frustrated by the lack of efficient elementary methods 10 to bound directly the Selberg integral. 
Indeed, Ivic [Iv2] applies Riemann zeta moments since the Selberg integral of dk has a strong connection with 
them (see §8). Further, it is interesting to analyze the cost of the loss when a non-trivial information on the 
correlations' averages at some nth generation level is transfered to the averages of the (n — l)th generation. 
For example, assuming that / is real, essentially bounded and balanced, from the trivial inequality u <C H 
one immediately has 

Jf{N,H)<^J f {N,H)+H 3 . 

Then, in order to obtain an inequality in the opposite direction, we appeal to the formulas deduced from (II) 
and (III) and write 



J f {N,H)<m:J o |/(/3)| 2 |u(/3)| 2 d/3+ff 3 «<yA^ | /(/3) | 2 1 u(f3) | 4 d/3+H 3 <^ sj NH 2 Jf (N, H)+VNH 5 / 2 +H 3 , 

where we have applied the Cauchy inequality and the Parseval identity (with /<gc 1). Thus, if for some H 
one has a non-trivial estimate of the kind 

J f (N, H) < NH 2 /G 

with some gain G — > oo, then the previous formula implies (for the same range of H) 

J f {N,H)<m.NH 2 G- 1/2 + N 1 ' 2 ^' 2 . 

This gives Theorem 1 by taking G = H in Conjecture CL. 

Hence, by the sole application of the Cauchy inequality a third generation gain G <C N/H leads to the 
gain y/G for the second generation estimate. We say that the exponent's gain has halved. 

The same phenomenon occurs for a general weight w with the alternative approach that we describe 

here assuming J wJ {N, H) < NH 2 /G. By taking E d = {/3 : \w(/3)\ > eH} wc have that the following 



f\W)\ 2 \w{P)\ 2 a^e 2 H 2 / |/(/3)| 2 d/?+l / |/ (; 3)| 2 .^ld/3 

JO J[0,1)\E £ Je n 



« NH^MW 2 ^ 2 + ±J wJ (N,H) « NH 2 \\f\\l(e 2 + G~ 1 e~ 2 ) 



is an optimal bound when e = G and we get the same halving of the exponent's gain as before. 
Now, recallling that \u{0)\ > H when < l/(2H) (sec [D], Ch.25), we get also 

H 2 i f(J3) d/3« / i f{P) .L^lLdl3< \f((3)\ 2 .L^lLdf3^J f (N,H)+0 E (N^H 3 ), 

that is a modified version of Gallagher's Lemma [Ga, Lemma 1]. In order to establish the aforementioned 
estimate of J3 the first author in [C5] applies such a version of Gallagher's Lemma together with the following 
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This will be coped by our forthcoming paper on mean-squares in short intervals (w— Selberg integrals). 
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further property of the essentially bounded and balanced functions; namely (see [C5] for the proof), if, for 
an absolute constant A e [0, 1) and for a fixed S e (0, 1/2) one has N s < H x < i? 2 < TV 1 / 2 -" 5 and 



J f {N,H)<^NH 1+A , VH € [H U H 2 ] , then J f (N,H)<^NH 1+1 ^, VH < H 2 . 

What about the trade of information from the second generation to the first? 

Let us take / = a + b real and essentially bounded, with b balanced. Then, we write 

E e /W ~ E E /(»)/(» + ft ) = E /(*) E /m - 

h<H h<Hn~N x~N x<m<x+H 

= J2-f( x ) E +£/(*) E 6 ( m )' 

ir~jV x<m<x+H xr^N x<m<x+H 

Again by the Cauchy inequality one has 

£/(*) X! Km) « ( E l/WI 2 E | E /(n)-M / (x,ff)| 2 ) V ViV 1 /2 1 / / ( i r,ir) 1 

x~A r x<m<x+H x~N x~N x<n<a;+H 

where the mean value is 



Mf(x,H) = J] a(n). 



de/ 

Hence, we get exactly an asymptotic formula with main term 



E f( x ) M f( x ' H ) > 

whenever the tails are negligible and mostly the remainder term 

£/(*) ^ bim^NWjj&H) 1 ' 2 

x^N x<m<x+H 

turns out to be sufficiently small after halving the exponent's gain on Jf(x,H). 
Moreover, when a is quasi-constant, one has Mf(x,H) ~ a(x)H, that implies 

53 £/(/>) £' H 53 a(x) 2 + ff 5^ a(x)b(x) +O e (N 1 ^J f (x,H) 1 / 2 ) ^ 

h<H x~N x~N 



r2N 

~ 5 # / a(t) 2 dt + ff V a(z)&(z) + O e (iV 1/2+£ J/(x, i/) 1/2 ) 



In particular, if / is stable of exponent a, then by applying partial summation to ^ N a ( x )b(x) one 
definitively gets the asymptotic inequality for the deviation of /, say, 



f (N,H) d = 53 G f {h)-Hj2 a{x) 2 <^NHG- 1/4 + N a H + H 2 



def 

h<H x~N 



whenever a non-trivial estimate for the second generation, Jf(x, H) -C NH 2 G 1 / 2 , holds for some ranges of 
short intervals width, large enough in terms of the exponent a. 

Hence, we find a possible general chain of implications of non-trivial estimates as 

J f (N,H)^:NH 2 G- 1 => J f (N,H)<^NH 2 G- 1/2 => D/(iV, £f)<gC NHG -1 / 4 . 
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The exponent's gain halves at each step, but if it remains a neat positive one, then we say that / is stable 
through generations. In §6 complete calculations are supplied for the case of the divisor function 03, while 
in future papers we are going to explore the hardest case of stable arithmetic functions (eventually) having 
no logarithmic polynomial. 

5. Toccata e fuga: the rare cases of asymptotic formulae for correlations. 

An expected asymptotic formula for the single correlations of / usually takes the form (compare [BP]) 

e f (h) = e f (h)i f (N) + n f (N,h) , 

where the product of the so-called singular series & f and the singular integral If constitutes the main term, 
while lZf(N, h) has to be proved an error term. Such terminology is customary within the Circle Method, 
that was originally introduced by Hardy, Littlewood and Ramanujan in 1918-20 to attack several additive 
Diophantine problems. One of the most famous and maddening problem is the infinitude of the pairs of 
2h—twin primes (n, n — 2h) that can be formulated in terms of the correlation of the von Mangoldt function: 

e A (2h) = £ A(n)A(n-2/i) 

Indeed, with the aid of the powerful analytic tools, Hardy and Littlewood predicted that for every sufficiently 
large N one has 

G A (2h) = & A (2h)N + K A (N, 2h) 

with a certain singular series & A (2h) » 1 and a conjectured remainder term TZ A (N, 2h) <C A(log N)~ A for 
every constant A > 0. Note that such an asymptotic formula would imply the infinitude of the 2h— twin 
primes, but nowadays nobody has yet proved such a conjecture. 

Apart from very special cases of functions or some trivial instances of the correlations (as they could be when 
h = 0), the lack of asymptotic formulae involves the correlations of the most significant arithmetic functions 
in number theory. One of the exceptional cases is given by the well-known binary additive divisor problem, 

e d ( ) - £ d ^ d ( n °) 

where d is the divisor function. This problem was known at least since [E] time and has been studied mainly 
through the consolidated theory of modular forms on SL(Z,2) (see [IwKo], [Vi]). By adapting Motohashi's 
results [Mo] to the problem in short intervals, namely 7^ a = o(N), one has 

e d ( a ) = e d (a)N + U f (N,a) , 

where 

6 d (a) = 6 d (a,logA0 & ' £(log ^ £ £ ^ , 

i=0 j=0 d\a - 1 

TZ d (N, a)<« A 2 / 3 + JVV2| o |9/40 + | o |7/i0 

(here these absolute c£ • are not Motohashi's constants [Mo, p.530]). In search of remarkable improvements 
on the latter formula, one has to be content with an extensive literature on moments of lZ d (N, a) (see [IM]). 
The general case of the additive divisor problem for dfc, i.e. establishing an asymptotic formula for 

Cfc(a) d = £ d k {n)dk{n - a) , 

is much harder and still unsolved when k > 3. Even the basic case of 63(a) seems to be hopeless with present 
technology due to the poor knowledge about the structure of SL("Z, 3). We address the interested reader to 
[C2] for a very short tale about this fascinating problem. Moreover, we recall that the Dispersion Method of 
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Linnik (see his book [L]) made its appearance to attack these kind of binary problems. In §2 we show only 
an elementary version of the Dispersion Method, while we have to mention the beautiful paper [Iw] as an 
example of a non-trivial application of it. 

In the modular forms environment the correlations are widely known as Shifted Convolution Sums (see [Mi]). 
In such a context, the possibility of establishing asymptotic formulae depends directly on the same structure 
of the modular forms. This is particularly successful for the special class of arithmetic functions given by 
the Hecke eigenvalues X(n). For example, Conrey and Iwaniec [Colw] provide an asymptotic estimate 

oo 

e x (h)~Nj2c q (h)p(<i) 2 , 

9=1 

where c q {h) is a Ramanujan sum, while we refer the reader to the quoted paper for the definition of p(q) and 
the intricate remainder term. It is remarkable that in a joint and unpublished work with Iwaniec the first 
author has easily deduced from the aforementioned formula of [Colw] non-trivial bounds for the symmetry 
integral of the eigenvalues A (essentially square-root cancellation) . Other spectacular advances on bounding 
the correlations of the Hecke eigenvalues have been achieved by Holowinsky in [Hoi], [Ho2] and then applied 
jointly with Soundararajan in [HoSo]. 

Returning to more familiar functions as dk, it seems that the only hope remains the Large Sieve and all 
the methods strictly related to such an inequality. In fact, similarly to the binary additive divisor prob- 
lem, the Large Sieve turns out to be crucial when one looks for asymptotic formula? for "mixed' correlations 
of d with some "reasonable" multiplicative arithmetic function (see [CO] , where the fundamental reference 
is Linnik's book [L]). Essentially the reason is that, due to the Dirichlet hyperbola method, the divisor 
function has "lever I < 1/2 of distribution in the arithmetic progressions, which is the same " 'large- sieve- 
barrier" for primes in arithmetic progressions, i.e. the celebrated Bombieri- Vinogradov Theorem. Note 
that the barrier I < 1/2 prevents one from applying this very strong tool to the correlations of the von 
Mangoldt function A, but it is harmless with the "truncated von Mangoldt" function A#, used in the break- 
through of Goldston-Pintz-Yildirim on small gaps between primes [GPY] . An alternate approach appeals to 
Duke-Friedlander-Iwaniec [DFI] bounds for bilinear forms of Kloosterman fractions instead of the Bombicri- 
Vinogradov theorem. It allowed the first author [C] to successfully establish asymptotic formulae for the 
correlations of essentially bounded functions / such that the Eratosthenes transform / * \i is supported 
up to O(x2 + i9o _£ ) and A#(n) with R -C x? + ~>~ e might be a remarkable example. However, this is a 
small improvement on the "level" for correlations, with respect to the aforementioned level in the arithmetic 
progressions given by the Large Sieve barrier. Sec the book [El] for the links between the concepts of level. 

A further possibility is open when / * fi vanishes outside of a very sparse set, where the "low density" 
has an actual effect on the level in arithmetic progressions. In this direction, we refer the reader to [BPW] 
and [To] on the k— free numbers, whose characteristic function is defined by 

d k \n q\n 

Indeed, it is plain that here the Eratosthenes transform g^ is supported on the k— th powers, a very "low- 
density" support. 

6. Crescendo: asymptotic formulae for d 3 in almost all short intervals. 

Here we turn our attention to non-trivial bounds for the Selberg integral of d 3 , postponing the general 
discussion on dk to next section. Ivic [Iv2] proved that the inequality 

J 3 (N,H) < N^H 2 

holds for the width > 1/6 with a neat exponent's gain E\ > 0. In other words, defining 9 > #3 as the range 
of the admissible width of the short interval for such an inequality, Ivic has proven 9 3 = 1/6. This is built 
upon the value 0-3 < 7/12, where at is the so-called Carlson's abscissa for the Riemann zeta 2fc-th moment, 

a k = f M{a G [0, 1] : £ \({<t + it)\ 2k At « T} 
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(see next section for some of the known values <Jk quoted from [IvO]). 

Conjecture CL provides improvements on Ivic's result since for every width 8 < 1/3 it yields the "best 
possible estimate", i.e. the square-root cancellation J 3 (N,H)<^NH, which in turn implies the optimal 
6 3 = through the arguments of §4. This estimate allows improvements on recent results [BBMZ] and [IW] 
for sums of correlations of d 3 . Further, it is worth to recall here that the lower bound J 3 (N, H) 3> NH log 4 N 
holds if < 6 < 1/3 (see [CO]). 

Now let us prove Corollary 1. 
Proof of Corollary 1. From the decomposition d 3 (n) = p 2 (\ogn) + d 3 (n) introduced in §3 one gets 

C 3 (/i) d = f d 3{n)d 3 (n -h)= d ^ m + = E d ^ n + + 'WW) = 

n~N N+h<m<2N+h n~N 

= ]T d 3 (n)d 3 (n + ft) + 2 ]T d 3 (n)p 2 (logn) + ^ p 2 (logn) 2 + O e (N s \h\) , 

n~7V n^N n^N 

where recall that cfe, ^3 are essentially bounded, while P2 (log n) is a quasi-constant function of n. Therefore, 
D 3 (N,H) = Y G 3 (h)-H ]T P2 (logn) 2 = ]T d 3 (n) ]T d 3 (m)+2H ]T d 3 (n)p 2 (\ogn) + O e (N" H 2 ). 

h<H n~N n~N n<m<n+H n~N 

By applying partial summation and (*) one has 
V d 3 {n)p 2 {\ogn) =p 2 (log(27V)) V d 3 (n) -I V d 3 (n) P ' 2 (1 ° g f) dt<« max I V d 3 (n) 

* //v t t<C2N I * 



Since the lower bound J 3 (AT, iJ) > TViJ log 4 AT holds at least for width < 6> < 1/3 (see [CO]), then Cauchy's 
inequality implies 



^ *(«) d 3(™)<« 



n~iV n<m<n-\-H 



N {T,\ E d 3 (m)-M 3 (x,H) +——^ y /NJ 3 (N,H) , 

£C~7V £c<m<:E+ff 



where recall that L = log TV and (see §3) 

M 3 {x,H) = Hp 2 {\ogx) = J2 P2(\ogm) + 0(x- 1 H 2 L) . 

x<m<x+H 

Thus, the conclusion follows immediately from Theorem 1. □ 
7. Main Theme: from all long intervals to almost all short intervals. 

The inductive identity dk = dk-i * 1 invites to explore a possible path in order to generalize our conjectures 
and results to each divisor function dk for k > 3 by infering formulas for dk in almost all short intervals from 
suitable information on dk-i in long intervals. However, the actual known values of the amplitudes ak seem 
to be a first serious bottle-neck. Further, while it might be comparatively easy to attack Conjecture CL, the 
path climbs up drastically when it comes to the general case k > 3. Although a general fc— folding method 
is available (see next Lemma 2, that is essentially the core of such a method), more and more technical 
problems are foreseeable as k increases. Besides, at the outset one has to face the problem of showing 
sufficient proximity of the analytic and the arithmetic forms of the mean value, at least by mean-square 
approximation, i.e. an inequality of the form 



\M k (x,H) - M k {x,H) «< N 1 - 2 /^ 2 , 



x~N 
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for a suitable choice of M k (x,H). Being unconceivable to give a rigorous general proof of such a proximity 
by direct calculations as we did for the case k = 3 in §3, at the moment we have to content oursclf with 
the following heuristic considerations. Some results of Ivic [Iv2] provide non-trivial estimates of the Selberg 
integral J k (N, H) with the mean value M k (x, H) assigned in the appropriate analytic form. Hence, Theorem 
2 legitimates the assumption that the arithmetic mean value M k (x,H) is close to the analytic counterpart 
M k (x, H) for every k > 3 in the ranges of the short interval width 9 provided by Ivic's results. Actually, we 
know that the analytic form is H times a k— 1 degree polynomial in logcc, the same shape that approximates 
(see next Lemma 2) the arithmetic form. Then, comparing Ivic's results with ours in a common range for 
9, we easily conclude that these polynomials must coincide (Amitsur formula [A] gives polynomials' degree 
and Tun's Lemma [Tu] the remainders). 

Now, let us turn our attention to next Lemma 2, where we show the so-called "fc— folding method". At this 
aim, assuming that H does not depend on x explicitly, let us consider for any fixed k > 2 the weighted sum 

S k (x, H) d = a k (n)w(n - x) = ^ ■■■ ^ q(m) ■ ■ ■ a(n k )w(n 1 ■■■n k -x) , 

n m nk 

def 

where a k = a* ■ • • * a is the A:— fold Dirichlet product of an arithmetic function a : N — > C and the weight 

k times 

w : N — > C is supposed to be uniformly bounded in its support, that is contained in [—H,H]. 
Further, for M d = [(N - H f/ k ] let us denote 

9k(q) = g k (q,a,M) d = a k -i{q) + Y a i-i(<?) ' 

j<k-i 

where 

4-i(5) = E • • • E ■ ■ ■ a ^-i) vj < * - 1 . 
"i "fc-i 

"i---"fc-i=a 
»i "j< M 

Lemma 2. If the arithmetic function a is essentially bounded, then Ve > and uniformly for every x <~ N, 
S k (x,H)= E 9k{q) E a w ^-^ +O ^{ N ij^ + W + 1 )) ■ 

q<x/M 0<\n-x\<H q 

7i=0(raod q) 

Proof. First note that n\ ■ ■ ■ n k > x — H > N — H implies that at least one of n\, . . . ,n k has to be > M. 
Then, let us define the partial sums So, Si, • • • , ^ k -i of S k (x, H) as follows: 

50 is the part of S k (x,H) corresponding to ni > M , 

51 is the part of S k (x,H) — S corresponding to n 2 > M, 

52 is the part of S k (x, H) — So — Si corresponding to 113 > M, and so on. 

Therefore, we set a k a \(q) d = a k -\{q) and split S k {x,H) as 

S fc (x,#) = S + Si + ... + S fe _i = 

= E "fc"-!^) E a(m)w(n-x)+ E 4-1 (?) E a(m)w{n - x)+ 

<1<— — m>M 9<^- m>M 

H h E 4-i 1) (9) E a(m)w(n-x)= E 9k(q) E a(m)w(n - x) , 

X + H n = qm x + H n = qm 

M m>M gS M m>M 

where for each j = 0, 1, . . . , k — 1 the multiple sum 

4-i(«)=E "' E o(m)---aK-i) 

"l---"fc-l=9 
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has j variables restricted by M (that depends on N, H, k, but not on x). 

' fc-i 



Observe that, since g k {q) — Y^ a k-i(l)> tnen IfffeC?)! — k\ a \k-i( ( l) < ^ 1> wnere we se t l a lfc_i = |a| * ■ ■ ■ * \a\. 
Thus, 



k — 1 times 



S k (x,H)= Y 9k(q) Y a{m)w{qm - x) 



m>M 



= Y 9k{q) Y a(m)w(qm - x) +O fc , £ ( Y Y = 

= Y 9k ^ Y a(m)w(qm- x) + O k , e ( Y Y x£ ) = 

q<x/M £^< ro <*±» ^«,<^ h-f |<f 

= E E a(^)w(n-x)+0^{N e Y Y X ) • 

q<x/M O^n-^H ^ <9 <^| m _ f |< f 

x — H N 

Since g > — >• — uniformly as TV < a; < 2 AT, then the O k e -term is 

MM ~ ~ 

H \ f H \(HM \ AT£ /H 2 H 



— — <QS—m~ 



An application of Lemma 2 to attack the General Conjecture CL would require a = 1, so that a k = d k , 
attached to the Cesaro weights. 

Remark. The General Conjecture CL together with Theorem 2 justifies the title of the present section: 
we positively interpret the (upper) bounds for the amplitude a k -\ as a property which holds in every long 
interval, while the consequent bound of J k clearly means an almost all short interval property. However, 
since the present state of knowledge assigns values to a k that grow towards 1 with k, the quality of the 
General Conjecture CL and Theorem 2 ruins unrelentingly for large values of k. Of course, as already seen 
when k = 3, the general scheme of implications of non-trivial estimates discussed in §4 still works in suitable 
ranges of short intervals, namely when the width of the short interval is sufficiently large with respect to a k . 
In particular, one easily proves Corollary 2 on the deviation of d k (see the definition in §0) for any k > 3 by 
closely following the Corollary 1 proof with the aid of the arguments in §4. 

Note that we have an improvement on the aforementioned Ivic's results on the Selberg integral J k (N, H) 

for k > 3. More precisely, 1 
Corollary 2 of [Iv2] one has 



def 

for k > 3. More precisely, Theorem 1 in [Iv2] holds for 6 e (0 k , 1) with 9 k = 2a k — 1. In particular, from 



ft 1 - n ft - 3 

04 ~ 4' ° 5 "30' 9e ~ 7 ' 

respectively corresponding to the following known upper bounds [IvO] for Carlson's abscissae, defined in §6, 

5 41 5 

(74 < — , (7k < , (7fi < — . 

8' ~ 60' 7 

Due to Corollary 2 such values of 9 k are superseded by 



il ft -I ft -I 
128' 5 5' 6 3 ' 



which follow from the well-known upper bounds for the amplitudes (see [Ti]) 

"3 < ^ (Kolesnik, 1981), a 4 < \ (Hardy-Littlewood [HL], 1922), a 5 < ^ ([IvO], Ch.13) . 
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It might be possible that 9k < 9k for any k > 6, although we think that it is unlikely. However, we put our 
emphasis essentially on the method: unlike Ivic, we do not use any deep property of the Riemann £— function 
(at least not explicitly) and rely uniquely upon known values of the exponents oik\ but one should not forget 
that most known values ak follow directly from non-trivial estimates of the moments of the Riemann zeta 
function on the line (for a clear digression on such a topic see the wonderful book by Ivic [IvO]). On the 
other side, as the first author pointed out (see [C2], Theorem 1.1), estimates of the Selberg integral of dk 
have non-trivial consequences on the 2k— th moments of the Riemann zeta function. Thus, recalling that 
Kolesnik has found his bound for a 3 without the aid of the 6— th moments and that ak known values go to 
1, it is plain that, at least for relatively low values of k, Theorem 2 confirms that the Selberg integrals of 
dk and the 2k— th moments of the Riemann zeta function are connected by a circle route. Next section is 
devoted to a further discussion on the argument. 

8. Finale: conditional bounds for the moments of ( on the critical line. 

As already mentioned in the previous section, at least in theory we could draw some consequences of Theorem 
2 on the moments of the Riemann zeta function. However, our method does not lead to a better result than 
those available in the literature for the 2k— th moment when 4 < k < 6 and maybe the scenario is even 
worst when k > 6. The reason is essentially the bound iV 1_1//fe _ff 2 in the General Cojecture CL, that is 
an unavoidable barrier term. It transfers from Jk(N,H) to a bound of the 2k— th moment, via the Selberg 
integral Jk(N,H), whenever we appeal to Theorem 1.1 of [C2]. 

Here we take the opportunity of applying Theorem 1.1 of [C2] to give some conditional bounds, depend- 
ing on estimates for the Selberg integrals Jk which are proved for k = 3, but unproved for k > 3. At this 
aim, let us define the excess Ek as a real number such that 



With an application of Holder's inequality from the well-known values E 2 = and E & = 1 (from Heath- 
Brown's bound I 6 (T)<^T 2 ) one gets 



According to this definition, Theorem 3 gives E k = ^{A + B)-B whenever J k (N, iJ)«S N 1+A H 1+B holds 
for H <C N x ~ 2 / k . In particular, as already mentioned, it gives E 3 = 1/10, when combined with recent [C5]. 

PROOF of Theorem 3. In [C2] the Selberg integral of d k is defined in the standard way as 



It is easy to see that a dyadic argument allows to replace [hx £ ,x] by dyadic intervals like [N,2N], and the 
substitution of the integral on [N, 2N] by Jk(N 7 H) generates only negligible remainder terms. Hence, we 
may apply Theorem 1.1 of [C2] by using Jk(N,H) instead of the above integral and Ve > small we get 



9. Epilogo: the best unconditional exponent of \(\ 6 mean on the line. 

As we saw in the introduction, a further immediate consequence of the recent result [C5] is that we have an 
elementary deduction of S3 = 1/10 just taking A = and B = 1/5 in Theorem 3 for H <C N 1 ^ 3 . However, 
the proof of Theorem 1.1 [C2] uses the approximate functional equation for ( k (and Gallagher's Lemma), 




E 3 = 1/4, £4 = 1/2, E 5 = 3/4 . 




4(T)<g;T + T max 
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so the whole study actually proving E3 = 1/10 (in [C5] Gallagher's Lemma is also applied) is just shorter, 
rather than elementary, with respect to the Heath-Brown's bound of Iq. 

Furthermore, see Corollary of [C5], our new approach based on the "modified Gallagher Lemma", 
contained in the forthcoming paper [CL], assures under Conjecture CL the even better excess E3 = for the 
Rlemann zeta function, namely the well-known weak 6—th moment 

The excesses Ek here (k > 3) are from a 2nd generation approach, while Ivic's [Ivl] is a 1st generation one. 
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